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ABSTRACT 



This study examines a common set of equating items 
for their appropriateness to both American and Canadian populations 
and, to determine the equations for converting scores from the verbal 
and mathematical scales of the 1971 form of the Canadian Scholastic 
Aptitude Test (the future standard continuing scales of the CSAT) to 
the corresponding standard continuing scales of the Scholastic 
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Intr oduc t i on 



The scores earned on the verbal and mathematical sections of* the College 
Board Scholastic Aptitude Test (SAT) are reported on a scale that extends from 
200 to 800 , a scale originally established by setting the mean and standard 
deviation of candidates tested in April 19^1 at the arbitrary values of 500 
and 100, respectively. The transformation from raw (formula) scores to scaled 
scores is a linear one, which preserves all moments of the raw score distribu- 
tion above the second, that is, it retains the shape of the raw score distribu 
tion. The SAT scale is a continuing scale; scores on each form of the test 
are equated to scores on the preceding forms with the effect that, within the 
limits of equating error, a score of 5 ^ 3 , for example, earned on any one form 
of SAT -verbal has the same meaning -- represents the same level of ability -- 
as a score of 5^3 on any other form of SAT-verbal, 

The scores earned on the verbal and mathematical sections of the Canadian 
Scholastic Aptitude Test (CSAT) of the Service for Admission to College and 
University (SACU) are also reported dn a 200-800 scale. Here, too, the trans- 
formation from raw (formula) scores to scaled scores is a linear one. Unlike 
the SAT-verbal scale, however, which is equated back to April 19^1, the 1971 
linear scale of the CSAT -verbal was defined on the form actually administered 
in January 1971 by setting the minimum raw score at 200 and the maximum raw 
score at 800. The two earlier forms of the CSAT -verbal (introduced in 1969 
and. in 1970 ) were scaled in the same way, independently for each form and not 
equated to each other. However, scores on all future CSAT -verbal forms will 
be equated to the CSAT -verbal scale for January 197 1?. Thus, the January 1971 
scale becomes the basic reference scale for CSAT -verbal and corresponds 
directly to the April 19^1 reference scale used for SAT-verbal. 

Statements similar to these Just made for the verbal tests are also 
appropriate for SAT -mathematical and CSAT -mathematical, respectively, 
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Although both the CSAT and the SAT are very similarly designed tests of 
verbal and mathematical aptitude, and although their scales are similar in 
appearance, it is clear from the foregoing, and also from the fact that no 
effort has ever been made to link the two systems -together, that equal- 
appearing scores in the two systems -- say, again, 565 on CSAT-verbal and SAT- 
verbal -- might have quite different meanings and reflect quite different 
levels of ability. Accordingly, the present study was undertaken to examine a 
common set of equating items for their appropriateness to both populations and, 
following that examination, to determine the equations for converting scores 
from the verbal and mathematical scales of the 1971 form of the CSAT (the 
future standard continuing scales of the CSAT) to the standard continuing 
verbal and mathematical scales of the SAT. 

Method 

The test forms used in the present equating study were the form of the 
SAT administered on January 9.? 1971 and the form of the CSAT administered on 
January 20, 1971. As an introduction to the design of the study it will be 
useful to review the administrative arrangements and content of the two tests. 
Both tests were administered in five separately timed sections: two operational 

verbal sections, two operational mathematical sections, and one nonoperational 
section administered in a time slot reserved for the presentation of pretest or 
equating items. Table 1 describes the item lengths and timing of the two tests 
and the order of appearance of the sections in the forms of the tests studied 
lie re . 

The SAT -verbal contains four item types : antonyms, analogies, sentence 

completion, and reading comprehension. Typically, all four item types are 
represented in both of the timed sections of the SAT. The CSAT-verbal items 
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are of the same type as those in the SAT. Unlike the SAT, however^ one of the 
two separately timed verbal parts of the CSAT contains exclusively discrete 
items; the other contains exclusively reading comprehension items. 

Table 1 

Administrative Arrangement and Content 
of the CSAT and the SAT 







SAT 








CSAT 




Section 


No. of 
Items 


Timing 
(Min. ) 


Section 


No . of 
Items 


Timing 
(Min. ) 


I 


Verbal 


ko 


50 


I 


Math 


20 


50 


II 


Verbal 


50 


45 


II 


Verbal 


55 


50 


III 


Nonop nl 


Variable 


50 


III 


Nonopnl 


Variable 


50 


IV 


Math 


55 


45 


IV 


Math 


20 


50 


V 


Math 


25 


50 


V 


Verbal 


20 


50 



Except for the data sufficiency items (the SAT has l8; the CSAT has none) 
the mathematical items in the SAT and CSAT are quite similar. The timing, 
however, is different. The SAT gives 1 l/4 minutes per mathematical item; the 
CSAT gives 1 l/2 minutes . 

As indicated above, the nonoperationai section of both tests (the third 
time slot in both the SAT form and the CSAT form used in this study) is used 
for pretesting items and also, in the SAT, for equating successive forms of 
the tests. In any formal administration of either test more than one non- 
operationai section may be administered in that time slot, with different 
students talcing different nonoperationai sections. In order to assure that 
randomly equivalent groups of students take one such section or another, .the 
test booklets are distributed in "spiralled" arrangement. Thus, if m non- 

V ' '-'5 " 
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operational sections are "to be administered, every mth student takes the same 

;■ such section. In the administration of the SAT, for example, it is usual for 

;; 

| m to be as high as 25 or 50 . 

| In preparation for the present study arrangements were made between the 

Service for Admission to College and University (SACU) and the College Entrance 



Examination Board ( C EE B ) to exchange 50-minute operational sections and for each 
of the two organizations to administer the sections of the counterpart organiza- 
tion in its own nonoperational position. Thus, in addition to the usual 50- 
minute pretests and 50 -minute equating sections normally assigned to the 
nonoperational position in the January 1971 SAT, Educational Testing Service 
(ETS) also administered (in behalf of CEEB) one of the two 50-minute operational 
CSAT-verbal sections and one of the two 50-minute operational CSAT -mathematical 
sections taken from the January 1971 CSAT. Correspondingly, in addition to the 
usual 50 -minute pretest sections normally assigned to the nonoperational posi- 
tion in the January 1971 CSAT, SACU also administered the 50-minute operational 
SAT -verbal section and the 50-minute operational SAT -mathematical section taken 
from the January 1971 SAT. 

With this arrangement a f ull 60 minutes of testing material in the verbal 
area — representing a total ox* 95 items — was ad min istered for purposes of 
equating to subsamples of both populations in connection with their operational 
testing. Thirty minutes of this material (10 items) was operational in the SAT 
(Section i) — i.e., contributing to the student's formal score — and nonoper- 
ational in the CSAT; thirty additional minutes (55 items), was operational in 
the CSAT (Section II) and nonoperational in the SAT.. Under the same sort of 
arrangement a full 60 minutes of testing material in the mathematical area 
representing a total of ^5 items -- -was administered, for the purposes of 
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equating, to subsamples of* both populations in connection with their operational 
testing. Thirty minutes of* this material (25 items) was operational in the SAT 
(Section V) and nonope rational in the CSAT; thirty additional minutes (20 items) 
was operational in the CSAT (Section i) and non operational in the SAT. As 



tests. The operational SAT -verbal section is referred to as SAT-Va and the 
operational CSAT -verbal section is referred to as CSAT-Vc. The total of* the 
two sections is referred to as Vt« Corresponding notation is used in the 
SAT- and CSAT -mathematical tests: ' Ma v Me, and Mt. 

The mathematical sections that were exchanged between the two organiza- 
tions were quite similar in content, representing a mixture of arithmetic 
reasoning, algebra, and geometry. The CSAT -verbal section that was adminis- 
tered along with the SAT contained sentence completion items, analogies and 
opposites only. The SAT -verbal section administered along with the CSAT 
contained these same item types, but in addition, included reading comprehen- 
sion items as well. 

The procedure followed in developing the conversions between the two 
testing systems called for two general steps: l) an examination of the 95 

common items in the verbal area and the ^5 common items in the mathematical 
area in order to identify items that appeared to have somewhat "different 
meaning" for American and Canadian students, and to eliminate such items from 
the group of common items to be used for equating the two tests; 2) the devel- 
opment of the necessary statistics, based on the remaining items, for equating 
the verbal and mathematical sections of the two tests. 

Examination of Item Data 

In order to carry out the first of these two steps, two mutually exclu- 
O give random samples of the SAT candidates who took Vc were drawn (these 



indicated' above, all non ope rational sections appeared in Section III of both 
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candidates also look Va as pari of Ine operational SAT), and similarly, two 
mutually exclusive random samples of the CSAT candidates who took Va were 
drawn (these candidates also took Vc as part of the operational CSAT). Four 
additional samples were selected for mathematical, and in the same way: two 

mutually exclusive random samples of the SAT candidates who took Me were 
drawn (these also took Ma operationally), and similarly, two mutually exclu- 
sive random samples of CSAT candidates who took Ma were drawn (tnese also 
took Me operationally). Each of the individual American samples consisted of 
450 cases; the Canadian samples, however, ranged in size from 405 to 420. The 
four SAT samples are referred to as AVI, AV2, AMI, and AM2; the four corre- 
sponding CSAT samples are referred to as CVl, CV2, CM1, and CM2. In addition, 
samples AVT, CVT, AMT, and CMT were formed. (The lettez* *s A and C denote 
American and Canadian students respectively; the letters V and M denote tests 
for which these data were to be analyzed: verbal and mathematical; the 

numerals 1 and 2 are used only to distinguish between random subsamples of 
students drawn from the same population and taking the same battery of tests. 
The letter T denotes a combined sample; thus AVT = AVI + AV2. ) 

The first step in the analysis — the examination of the performance of 
the American and Canadian students on the seme items — involved the use of a 
technique that will bear detailed description. In comparing two samples of 
individuals in this way, an item analysis is carried out for each of-the two 
groups and a plot is made of the points represented by the paired A- values , 

A vs A^ *, one point for each of the items (i) under consideration for 

n i 

*Delta (A) is an index of item difficulty for an item. A = 4z + 15 * where z 
is a normal deviate corresponding to p, the proportion of examinees answering 
the item correctly. • 
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which A- values are available. The plot of these points normally appears in the 
form of an ellipse extending from lower left to upper right , and if the samples 
are drawn from the same type of population, the scatterplot of these points is 
a long, narrow one, often representing a correlation as high as .98 or -99- 
When the samples are different in level., the points will still fall in a long, 
nar row ellipse, but displaced vertically or horizontally, depending on which 
group is the abler one. Even when the groups differ in degree of dispersion, 
the points will still fall in the same type of ellipse , but the ellipse will 
be tilted at an angle more or less steeper than ^5° , depending on which sample 
is the more disx^ersed. However, when the groups are different in type , the 
item difficulties will not fall in precisely the same rank order for the two 
groups, and the correlation represented by the delta-points will be lower than 
.98 or .99. The items falling at some distance from the plot may be regarded 
as contributing to the item x group interaction. They are the items that are 
especially more difficult for one group than for the other, relative to the 
other items . 

The method developed for summarizing the significant features of each 
plot involves the determination of the major axis of the ellipse represented 
by the plot and the calculation of the perpendicular distance (D^) from each 
point to that line. The standard deviation of the distribution of these 
distances is a function of the item x group interaction. As indicated above, 
the correlation coefficient represented by the ellipse represents the degree 
to which the items have the same rank order of difficulty in the two groups 
also a representation (inversely) of the item x group interaction. 

The equation used for. the major axis of the ellipse is a linear one, 
h = Pg + Q, where 
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t d 



p = 



- 2 ) 

g 






h 



2,2 

“ O + 



,222 
4r . <j a, 
gh g h 



2r . a a, 
gh g h 



and 



Q = M h - PM 



(it is recalled that the variables, g and h, are, respectively, the delta 
values for the two groups under consideration. ) The formula for the perpen- 
dicular distance, D., of each point, i, in the plot to the line is given as: 



D. = 



Fg ± - h ± t Q 



A 



P c + 1 



In preparation for the examination of the delta plots, six item analyses 

•*<L 

were carried out for the verbal tests, ore for each of the four separate verbal 
samples (AVI, AV2, CVl, and CV2) and one for each of the combined samples (ATT 
and CVT). Similarly, six item analyses were carried out for the mathematical 
tests, one for each of the four separate mathematical samples (AMI, AM2, CM1, 
and CM2) and- one for each of the combined samples (AVT and CVT). Tables 2 and 
3 summarize the data of the item analyses . Table 2, which summarizes the data 
for the verbal tests, makes it clear that the mean test scores for the Canadian 
examinees are higher by l/5 to 1 A of a standard deviation than are the means 
for the American examinees. This difference may be observed on the items that 
were originally Canadian, and also on the items originally American. As expected, 
the item data support this observation; the mean deltas for the American samples 
are higher than the mean deltas for the Canadian samples and indicate that the 
items are more difficult for the Americans than for the Canadians . It. is also 
observed in both the Va and Vc items that the Canadian samples are more homogene- ; 
ous than the American samples. This finding is also supported by the standard 
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*Be cause of the difficulty in establishing the unique identities of some cases in Samples CVl and CV2, 40 
cases were dropped in the process of combining these samples to form Sample CVT. 
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deviations of deltas, which are larger for the Canadians than for the Americans.* 
To what extent these various observations can be generalized beyond the observed 
samples it is impossible to say. At the very most, the data indicate that the 
CSAT group tested in January 1971, from which the CSAT samples were randomly 
drawn, is more able on the verbal abilities measured by the Va and Vc items than 
the SAT group tested in January 1971* It is beyond the scope of these data to 
give evidence on the extent to which these groups are differently representative 
of their parent populations . 

The data on the biserial coefficients are interesting in that they indi- 
cate that items originally American are slightly more discriminating for the 
Americans (r^ = .^9 for AVT) than for the Canadians = .k-6 for CW), 

while items originally Canadian are slightly more discriminating for the 
Canadians (r^ = -55 for CVT) than for the Americans (r^ s = .52 for AVT). 

This finding in turn suggests the possibility that the items in the t>io tests 
may be homogeneous in content, but specific to the students for whom the items 
were intended. 

Table 5, which summarizes the data for the mathematical tests, repeats- 
some of the findings reported in Table 2. As in the verbal tests, the Canadian 
students score higher than their American counterparts by almost half a stan- 
dard deviation on both sets of items, those originally Canadian as well as 
those originally American. As expected, the means of the item deltas confirm 
these findings ; the mean deltas for the American students are higher than 
those for the Canadian students, indicating that the items are more difficult 

*In general, standard deviations of item difficulties are larger for more 
. homogeneous groups and smaller for more heterogeneous groups . .. 




for them than for the Canadian students. Also, as in the verbal tests, the 
Canadians are more homogeneous as a group than the Americans on both the Ma 
and Me subtests. This finding is also supported by the item data, which show 
a larger standard deviation of deltas for the Canadians. The data on item 
biserials are less clear for the mathematical items than they are for the 
verbal items. Although the item biserial correlations for mathematical are 
higher for the Americans on the items originally American, they are about 
equal for the two national groups on the items originally Canadian. 

Figures 1-5 present the delta plots for the verbal items. Figure 1 is 
the plot of all 95 items for one random sample of American students versus 
the other (AVI vs_. AV2); Figure 2 is the plot for the same set of 95 items 
for one random sample of Canadian students versus the other (CVl vs.. CV2), 
and Figure 5 is the plot for the same set of items for the combined sample of 
Americans versu s the combined sample of Canadians (AVT vs. CVT). It .is clear 
from these figures that the cross -national plot (Figure 5) shows a greater 
dispersion than does either of the wi thin-national plots (Figures 1 and 2). 
These observations are supported by the correlations represented by these 
plots (.976 for AVT vs. CVT, as compared with .990 for AVI vs . AV2 and .991 



for CVl vs. CV2) and also by the values of <7 (.4-58 for AVT vs. CVT, as com- 

pared with .287 for AVI vs. AV2 and .296 for CVl vs. CV2). 

Figures 4-6 present the corresponding delta plots for the 4-5 mathematical 



items: Figure 4- is a plot of the deltas for one random sample of Americans 
versus another (AMI vs. AM2); Figure 5 is a plot of the deltas for one random 
sample of Canadians versus another (CM1 vs. CM2), and Figure 6 is a plot of 
the deltas for the combined American sample versus the combined Canadian sample 
(AMT vs. CMT). Here too the cross -national plot (Figure 6) is more dispersed 
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Figure 1 
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Delta Plot of Verbal Items for 
Canadian Sample #1 vs Canadian Sample #2 
(CVl vs CV2) 
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Fig-ure 3 
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Total American Sample vs Total Canadian Sample 
(AVT vs CVT), 



r = .97 6 






**** 



* 



A A 

1 A A 



a a < 



JA A 



A* A *A 



A® 



• a a 



A A 



A AA 



A A A 
A 



Code *' 



a Canadian items 
• American items 
<$> Circled points represent 
two observations 



_L_ 



_L 






i-: I 



_L 



_L 



5.0 6.0 7.0 8.0 9.0 10.0 1 1.0 12.0 13.0 14.0 15.0 16.0 17.0 ! 8.0 19.0 2 

; group avt 



o 

ERIC 



37 



■16 



Figafe 4 



Delta Plot of Mathematical Items 
American Sample #1 vs American Sample #2 
(AMI vs AM2 ) 
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Delta Plot of Mathematical Items 
Canadian Sample #1 vs Canadian Sample #2 
(CM1 vs CM2) 
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than either of the wi thin-national plots (Figures and 5)> and, as expected, 
the statistics based on the plots bear out these observations. (The correla- 
tion for AMT vs_. CMT is .977^ as compared with the correlations of .991 for 
AMI vs. AM2 and .99^ for CM1 vs . CM2; the cr^ for AMT vs_. CMT is .415* 
as compared with the cf .233 for AMI vs.. AM2 and .2^+2 for CM1 vs . CM2.) 

The six distributions of D-values, corresponding respectively to Figures 1-6, 
are given in Table 4. 

. Although these differences are clear and indisputable, certain qualifi- 
cations are in order. The two sets of wi thin-national samples, it is recalled, 
were selected randomly, and the differences between them are as small as the 
method of random sampling and the sample size permit. If systematic samples 
(e.g_. , samples drawn from somewhat different geographical areas within the 
United States or within Canada) had been drawn, we would expect the resulting 
delta plots "to be considerably more dispersed than the observed random wi thin- 
national samples. In some instances, even larger differences in dispersion 
of deltas could result from geographical sampling for within -national groups 
than for the random cross -national samples observed here. Therefore, to draw 
the general conclusion that between-national comparisons are more dispersed 
than within-national comparisons is not warranted by these data. 

In any case, the dispersions observed in the cross -national plots are 



too small to warrant the conclusion that the American items, for example, are 
inappropriate for the Canadians, or vice versa. However, since this issue is 
of itself not central to the purpose V of the present study — which was to find 
the equations to be used for converting the scaled scores on the SAT to the 
scale for the CSAT-- it was not pursued in detail. Instead, the distributions 



of D-values for the .cross -national plots (A 



vs . CVT and AMT vs . CMT) were : 



O 
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Table 4 



Dis tr ibut ions 


of* D -Values 


for the 


Item Delta 


Plots 






Plots 


of* Verbal Items 


Plots of 


Mathematical Items 




AVI 


cvi 


avt 


AMI 


cm 


AMT 




VS • 


vs. 


vs. 


vs . 


vs • 


vs. 


D -Value 


AV2 


CV2 


CVT 


AM2 


CM2 


CMT 


1.2 






1 








1.1 






1 








1.0 






1 
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0.9 ■ 
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0.6 .'j 
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2 'V ; 


6 
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5 


6 


4 


2 
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5 


0.5 


11 


4 


5 


2 


k 


: 2 


0.2 


9 


15 


8 


8. 


3 


3 


0.1 


8 


16 


6 


5. 


10 


5 


0.0 


l4 


17 


10 


9 


k 


6 


0.1 


12 


8. 


8 


. 6 


7 


6 


O 

ro 


9 


5 


15 


4 


9 


1 


0.5 


12 


' 7 ■ 


4 


6 


3 


5 


0.4 


4 


4 : 


6 


l 


■ 1 ■ 


0 


0.5 


2 


5 '• 


5 O’. 


0 


1 


2 : 


0.6 


1 


2 


,■ i : ;■ 


1 




1 


6.7 


2 


1 


' 3,. c..'. 






2 


0.8 




1 


2 






1 


0.9 






1 






0 - 


1.0 






1 ••'ll/ 






0 


l.i 






i 






.. 1, . 


No. of Items 




• r\cz 






IlR 






93 




















0 0 




Mean 




■■ u. u 










S.D. 


.287 


.296 


.458 


.255 


.242 . ; 


a -415 ; 


Plot Correlation 


.990 


.991 : 


.976 


■ -991 ' 


•994 


• 977 
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examlned to exclude from the set of items used in equating the CSAT to the 
SAT those items that were considered to be re lat ive ly inappropriate to 
Americans or inappropriate to Canadians. These items were taken to be the 
"outliers" (i.e., those with high positive or high negative D-values), con- 
tributing heavily to the item x nationality interaction. Accordingly, the 
items removed from the verbal equating sections were those for which -0.55 ^ 

> 0.55; "the items removed from the mathematical equating section were 
those for which -0.52 > > 0.52. This process resulted in the removal of 

20 of the 95 Verbal items (9 Canadian and 11 American items) and 10 of the i+5 
Mathematical items (i|- Canadian and 6 American items). 

Equating 

All available Canadian examinees who had taken Section Va as their 
nonope rational section and all available Canadian examinees who had taken 
Section Ma as their nonoperational section, exclusive of those who had been 
part of the item analysis samples (CV1, CV2, CM1, or CM2), were selected for 
the equating process. This selection resulted in Canadian samples of 3--I-95 
for the verbal equating and 3315 for the mathematical equating. Corresponding 
American samples of 3^1-95 and 3320 (approximately the sizes of the Canadian 
samples) were selected at random from the available American examinees who had 
taken Section Ve or Me, exc lus ive (as in the choice of the Canadian samples) 
of al 1 those who had been part of the item analysis samples (AVI, AV2, AMI or 
AM2). 

The method used for equating the Canadian and American scales involved l 
first the determination of the equation relating the raw (R- W/k) scores on the 
operational American SAT to the raw (R - \t/k) scores cn the operational Canadian 



CSAT and substituting into that equation the relationship between raw scores and 
scaled scores for each of the two testing programs to yield a final relatxonshxp 
between American scaled scores and Canadian scaled scores. 

The initial conversion between raw scores on the two systems required 
first an estimation of the mean and variance for the combined American- 
Canadian sample on each of the two tests in accordance with a method developed 
by Tucker (Angoff, 1971.? P* 5^0 ), as follows: 



M = M + b (M - M. ) , 

x t x a v t 



( 1 ) 



^ 2 . 2.2,2 2 v 

s =s +b (s -s ) 
x t x a ^Oi V t a 



( 2 ) 



and 



where 



A 



= M + b 



y 



(M - M ) 



P 



V t 



S 2 = S 2 + b 2 (s 2 - s 2 ) 

y t y p yV v t v p 



( 3 ) 



00 



and 



x or X = the test (CSAT) taken by Group a (Canadian ), 
y or Y : = the test (SAT) taken by Group p (American), 

v = the equating test, i.e., scores on the items taken 

by both Group CC and Group p and used in the equating, 

t - the combined group, OC + P • 



The notation, b , represents the usual coefficient of regression of variable x 



on variable v: b^ = r^o^/qv 



v. b = r cr /cr .), The estimated 

* * yv yv y' v ' 



values, M , M , s , and s , were then substituted in the equation, 

■V y X V. 



y. 



o 
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( 5 ) 



to yield the equation., 

Y = aX + b , (6) 

converting raw scores on Test X to the scale of the raw scores on Test Y. 

In this equation , a = s /s and b = M - aM 

y t x t y t x t 

In order to calculate the estimated values given in equations (l) to (4) 
for the verbal tests , the correlation between the operational test and the 75 _ 
item equating section,* as well as the related means and standard deviations, 
were prepared for each of the two verbal samples, the sample consisting of the 
5495 American students and the sample consisting of the 3493 Canadian students. 
These statistics are given in Table 5. Distributions of the operational verbal 
and equating verbal tests for the SAT and CSAT aye given in Table 6. 

The application of the data in Table 5 to equations (l) to (4) resulted in 
the following estimated values: 

M » 50.6009 , 

x t 

M = 35.5304 , 

y t 

i 2 =257.0424 , 

x t ... ■ .... 

■ and 

S 2 = 289.9611 



*Re call that 20 verbal "outlier 1 ' items had been removed from the original group 
of 95 ite ms administered to both the American and the Canadian examinees. 
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Table 5. 

Correlations Between Operational and Equating Tests 
for American and Canadian Samples 



Verbal Tests 





American 

Sample 


Canadian 

Sample 


No. of Cases 


3495 


5495 


Operational Test* 
Mean 

Std. Dev. 


31.8710 

17.5004 


r 33.9969 
14.1405 


Equating Test** 

Mean 

Std. Dev* i 


i 

j; • 25 . 8^89 

15.1437 


32.5663 

13.3905 


Correlation: 

Opnl. vs. Equating 


.9431 


.9572 



-The operational SAT -verbal contained 90 items: the opera- 

tional CSAT -verbal contained 75 items. 



**The verbal equating test consisted of 75 items (after the 
removal of 20 "outlier ,r items). 
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Table 6 

Frequency Distributions for Operational and 
Equating Sections of the SAT and CSAT 



Raw 

(Formula) 

Score 



Verbal Tests 



American Sample 



Canadian 



Operational 

SAT 



Equating 

Section 



27 



Operational 

CSAT 



84 - 86 


2 






81 - 85 


5 






78 - 80 


11 






75 - 77 


11 ' 




. «. 


72 : - 7k 


20 




1 ' 


69 - 71 


29 


7 


10 


66 - 68 


52 


9 


50 


65 - 65 


56 


19 


50 


60 - 62 


6l 


28 


54 


57 - 59 


67 


55 


65 


5k - 56 


116 


52 


115 


51-55 


118 


72 


157 


48 - 50 


146 


88 


169 


45 - 47 


145 


97 


207 


42 - 44 


178 


154 


210 


59 - 4l 


221 


181 


255 


56 - 58 


205 


201 


26l 


55 - 55 


201 


2l8 


265 


50-52 


202 


244 


525 


27 - 29 


212 


267 


270 


24-26 


214 


251 


255 


21 - 25 


225 


257 


229 


18 - 20 


204 


254 


156 


15: - 17 


175 


221 


158 


12 - 14 


l6l 


196 


105 


9 - 11 : 


b 157 


210 


72 


6 - 8 


115 


i56C' : : 


= b-’CTf 57 


5 - 5 


92 


150 


45 


0 - 2 


48 


85 : 


9 


-5 - -1 


55 


50 


6 


-6 - -4 


16 


29 


b v . ,..3. 


-9 - -7 


1 


5 




-12 - -10 




1 




No. of Cases 


5495 


5495 


5495 


Mean 


51.8710 


25.8489 


55.9969 


Std. Dev. 


17.5004 


15.1457 


14.1405 












Sample 

Equating 

Section 



1 

10 

l4 

55 

45 

57 

79 

110 

127 

205 

206 
206 

297 
295 
510 
518 

298 
227 
195 
162 
110 

88 
51 
50 
: : 15 

: ' ;u; 4 r u 



1 

1 




52.5665 



15.5905 
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These values were applied to equation (5) to yield the equation, 

Y = 1.1060 X + 1 . 68 5T 0 > ■ (T ) 

which permitted the conversion of scores from the raw score scale of the CSAT- 
verbal to the raw score scale of the SAT -verbal. The derivation of the con- 
version from the CSAT -verbal reporting scale to the SAT -verbal reporting 
scale is developed as follows: 

The linear equation, 

S = AX + B , ( 8 ) 

c 

is the equation by which raw scores on the form of the CSAT used in January 
1971 (X) are converted to the CSAT reporting scale (Sc). Similarly, the 
linear equation, 

S = A’Y + B’ , (9) 

3 / 



is the equation by which raw scores on tbe form of tbe SAT used in January 

1971 (Y) are converted to the SAT scale (Sa). Expressing equation ( 8 ) in 

t S - p S * B 1 ! 

x = c A and equation ( 9 ) in terms of Y |^Y = \ a A «— J and 

J . . . . s a - B ’ J S c ~ B 1 

substituting in equation ( 6 ) results in the equation, _ a [_ A J + b 

which, when simplified, becomes 





+ A’b + B' 



aA ' B . (10) 



Equation (10) is a linear equation with slope equal to aA'/A and intercept 
equal to A'b + B' - , and may be used to convert verbal or mathematical 

scores from the January 1971 converted score scale for the CSAT to correspond 
ing scores on *bhe SAT scale* • '• 
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In order to derive the numerical conversion from the CS AT -verbal scale to 
the SAT -verbal scale, the following numerical values for the slopes and inter- 
cepts of equations (6), (8 )^ and ( 9 ) were applied to the constants in equation 

( 10 ): ; . -P _ . 

a =111060; b = I.6857 [from equation (6)) , 

A = 7.2555; B = 256.OOOO [from equation (8)) , and 

A' = 6.5656; B' = 229 . 2008 [ from equation (9)] . 

The resulting conversion equation for the verbal test is, therefore , 

s = 0.9706 s - 8.5528 . (11) 

a c 

The inverse equation,, for converting the SAT -verbal scale to the CSAT-verbal 



scale, is : 

S = 1.0505 S + 8.8II9 • (Ha) 

c a 



The following equivalencies were determined from equations (ll) and ( 11a ) : 



From Equation (ll): From Equation (lla): 



CSAT-verbal 

Score 


Equivalent 
SAT -verbal 
Score 


SAT -verbal 
Score 


Equivalent 

CSAT-verbal 

Score 


800 


768 


800 


(855)* 


700 


671. 


700 


750 


600 


57^ 


600 


627 


500 


. kll 


500 


= 52k 


1^00 


580 


t-00 


t-21 


500 


285 


;r ■ 500 


518 


200 


(186 )* 


200 


215 



More detailed conversion tables for the verbal tests are provided 



in Appendix I. 



*S cores lower than 200 or greater than 800 ■ on the SAT are reported as 200 or 800, 
respectively. The same procedures will be employed on all future forms of ishe CSAT- 




- 28 - 

Directly parallel procedures were followed in deriving the e equation for 
converting scaled scores on the CSAT— mathematical to scaled scores on the SAT 
mathematical. In order to calculate the estimated values given in equations 
(1) to (li) for the mathematical tests, the correlation between the operational 
test and the 35 _ H em equating section,* as well as the related means and 
stan dar d deviations, were prepared for each of the two mathematical samples, 
the sample consisting of the 3320 American students and the sample consisting 
of the 3315 Canadian students. These statistics are given in Table 7 . Dis- 
tributions of the operational mathematical and equating mathematical tests 
for the SAT and CSAT are given in Table 8. : 

The application of the data in Table 7 to equations (l) to (4) resulted 
in the following estimated values; 

M, « 15*7^79 

t 

M 26.2079 
y t 

. si = 88.7126 , 

; : ' ' '/ • x t ■■ . 

and ■ 

. . : il ■ S 2 = 162.9277 

. . , . V : .V : ; y t -• : • - ■ . ... 

These values were then applied to equation ( 5 ) to yield the equation, 

Y = 1.3552X + i ^.8663 , ( 12 ) 

which permitted the conversion of scores from the raw score scale of the CSAT- ■ 
ma thematical to the raw score scale of the SAT-mathematical. Ip. order to 
derive the conversion from the CSAT -mathematical scale to the SAT-mathematical 

* Re call that 10 mathematical "outlier" items had been removed from the original 
group of 15 ite ms administered to both the American and the Canadian examinees. 

ERIC i-,;:: . , - - v - v . • 'V- : 
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Table 7 

Correlations Between Operational and Equating Tests 
for American and Canadian Samples 



Mathematical Tests 





American 

Sample 


Canadian 

Sample 


No. of Cases 


3320 


3315 


Operational Test* 
Mean 

Std. Dev. 


22.8536 

15.1620 


18.2^77 

8.3968 


Equating Test** 
; Mean 
Std. Dev. 


12.0973 
8.5261 ~ 


16.7^93 

7.2081 


Correlation : 

Opnl . vs • Equating 


. 93^8 


.9219 



*The operational SAT -mathematical contains 60 items ; the 
operational C SAT -mathematical contains ^0 items. 

**The mathematical equating test consisted of 35 items 
(after the removal of 10 "outlier" items). 



Table 8 

Frequency Distributions for Operational and 
Equating Sections of the SAT and CSAT 

Mathematical Tests 




Raw 

(Formula) 


American 


Sample 


Canadian Sample 


Operational 


Equating 


Operational 


Equating 


Score 


SAT 


Section 


CSAT 


Section 


58 - 59 


8 








56 - 57 


8 








54 - 55 


19 








52 - 53 


13 








50 - 51 


31 








48-1*9 


39 








46 - 47 


42 








44 - 45 


70 








42 - 43 


78 








4o - 4i 


81 




4 




38 - 39 


105 




33 




36 - 37 


98 




23 




34 - 35 


135 


8 


80 


25 


32-33 


156 


24 


100 


24 


30 - 31 


170 


46 


125 


91 


28 - 29 


l64 


82 


175 


122 


26-27 


162 


91 


173 


149 


24 - 25 


180 


129 : 


200 


210 


22 - 23 


167 


161 


194 


260 


20-21 


190 


156 


255 


301 


18 - 19 


188 


221 ' ' 


306 


302 


16 - 17 


176 •: 


212 


304 


338 


14 - 15 


153 


234 


311 


368 


12 - 13 


,132 


26l 


: '.276 


313 


10-11 


179 


286 


. 257 


267 . 


8 - 9 


146 


293 


198 


- 213 


6 - 7 


. 100 


263 


"-V. 128 


' .136 


4 - 5 


121 : 


263 


89 1 : 


. 104 


2 - 3 


8l 


. 220 


48 


53 


0 - 1 


62 


C-\;y*2l£> 


25 r V, 


29 


-2 - -1 


7 35 


112-7 


' : 9 


,iv 6 


-4 - -3 


19 


32 


. 2 ■ 


. : : i : 3 , 


-6 - -5 


■ r-.r 7 


7 ; 7 . 




. -.''.1 :--x\ 


-8 - -7 . 


■. ■ 5 


0 






-10 - -9 




7 1 ' 




■ -A-. \ 


No, of Cases 


3320 


: 3320 


3315 


3315 : 


Mean 


22.8536 


12.0973 


18.2477 


16.7493 ; 


Std. Dev. 


13.1620 


8.5261 '7- : 7 


8.3968 


7.2081 7 j 






, - 31 - 

scale, the following numerical values from the slopes and intercepts of equa- 
tions (6), (7), and (8) were applied to the constants in equation (9): 

a = 1.3552; b =. 4 . 8663 [from equation (6)] , 

A = 13 - 9550 ; B = 242.0000 [from equation ( 8)3 , and 

A’ = 8 . 5922 ; B' = 273.2077 [from equation ( 9)3 . ' 

The resulting conversion equation for the mathematical test is, therefore, 

s = 0.8347 s + 113.0219 . ( 13 ) 

a c . 



The inverse equation, for converting the SAT -mathematical scale to the CSAT- 
mathematical scale, is : * c 1 

S = 1.1980 S - 135 - 4042 . ( 13 a) 

C a 

The following equivalencies were determined from equations (l3 ) and (15a): 

From Equation 



CSAT -math 
Score 


Equivalent . 

SAT -math 

: . • Sc ore -l l ,'. 


SAT -math 
Score 


Equivalent 
C S AT -math 
. Score 


8 00 


781 


800 


(823)* 


700 


; 697 • Iv-/..-;"''? 


0 

0 


703 


600 


; 6 l 4 7 : '7 : -". 


600 


583 , 


500 7 


v 530 


: 500 


464 , 


400 


447 ' 


4 00 


344 


300 


363 


• 300 7 


224 


200 7 


280 4.: 


200 


( 104 )* 



More detailed, conversion tables for the inathematical tests are provided in 



Appendix II. 



^Scores lower 
respectively., 



r'.l 



than 200 : or . greater •; than 8 00 will be yepoi^ted as 200 ; and > 8p0’^; 
on all future forms of the CSAT. ■> The same procedures are employed 



O 

ERIC 



on all forms of the SAT. 
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The incidental observation is made that the verbal and mathematical tests 
of the CSAT and the SAT are all quite similar in average difficulty. The 
est ima ted formula score means for the verbal tests for the combined SAT-CSAT 
verbal sample both represent a mean item difficulty of about 0.4 (0.4-1 for 
CSAT -verbal and 0.59 for SAT -verbal) in terms of the same combined verbal 
equating group. The estimated formula score means for the mathematical tests 
show a slightly greater difference; the mean item difficulty on CSAT -mathematical 
estimated for the combined mathematical equating group is 0.59 while the mean 
item difficulty on SAT -mathematical for the same group is slightly greater -- 
0.4-5 -- but still very much in the same range. Although the reason for this 
difference can only be a matter for speculation, it is possible that part of it, 
at least, is attributable to the more generous timing allowances in the CSAT. 

Ratios of the number of items to the estimated standard deviation for 
the combined samples were also obtained. These ratios were 4 -. 87 for CSAT- 
verbal and 5.29 for SAT -verbal, and 4 -. 25 for CSAT -mathematical and 4-. 70 for 
SAT -mathematical. These figures suggest the possibility that the mathematical 
tests yield more discriminations and may therefore be more reliable per unit 
test than the verbal tests. . The figures also suggest the possibility that the 
tests of the CSAT are slightly more reliable than those of the SAT per unit test. 

Conversion Results 

From equations (ll) or (lla) and (15) or (l5a), and from the conversion 
tables that follow those equations it is clear that, although the Canadian and 
American scales are similar in the region of the 500-score, there are detailed, 
differences. The- equivalent SAT -verbal scores are lower than the corresponding 
CSAT -verbal scorers throughout the scaled score range, the differences ranging 
from -14- points at the CSAT -verbal score of 200 to -52 points at the CSAT -verbal 
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score of 800. At 500 the difference is midway between these extremes, -25 
points . 

The differences in the mathematical scales are greater, however, and for 
the most part in the opposite direction. At the CSAT -mathematical scaled score 
of 200 , the difference is as much as 80 points; thus, a student with a score at 
the 200 -score level on CSAT is estimated to be equivalent in ability to a 
student with a 280 score on SAT. At the opposite end of the scale, the differ- 
ence is much s mall er, but in the other direction; a student at the 800 -score 
level on the CSAT scale would have a score equivalent to a score less than 800 
on the SAT scale, namely 781. As expected, the difference at 500 is about mid- 
way between these; a score of 500 on CSAT-mathematical is equivalent to a score 
of 530 on SAT -mathematical. 

Summary 

In this equating study the form of the CSAT administered in January 1971 
was equated to the form of the SAT also administered in January 1971- The 
design for equating involved the administration of a one -half hour operational 
verbal section and a one-half hour operational mathematical section of the 
CSAT as "exper ime ntal” (nonoperational) tests to random samples of the January 
1971 SAT group. Similarly, a one -half hour operational verbal section and a 
one-half hour operational mathematical section of the SAT were administered as 
"experimental" (nonoperational) tests to random samples of the January 1971 
CSAT group. As a result of these symmetrical "cross-over" administrations, . 
random samples of the American and the Canadian groups took the same two 
sections of verbal test materials. Other random samples of the American and 
Canadian groups took the same two sections of mathematical test materials . 

These common sections, representing 95 verbal items and mathematical items. 



were then item analyzed, and item difficulty (delta) plots were made to identify 
"outlier" items — items that were not equally appropriate for the two groups of 
examinees. As a result of these analyses, 20 verbal' and 10 mathematical items 
were identified as "outliers" and removed from the equating sections. Data on 
new samples were developed with the remaining 75 verbal equating items and 55 
mathematical equating items to equate, raw scores on that form of the CSAT to 
corresponding scores on that form of the SAT. With these equations in hand and 
with the equations converting each set of raw scores to its own reporting scale, 
conversion equations were developed relating the verbal and mathematical scales 
for the CSAT to the corresponding scales for the SAT. 
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